Quasi-local evolution of cosmic gravitational clustering in weakly 

non- linear regime 

Jesiis Pando/ Longlong Feng^ and Li-Zhi Fang^ 
ABSTRACT 

We investigate the weakly non-linear evolution of cosmic gravitational clus- 
tering in phase space by looking at the Zel'dovich solution in the discrete wavelet 
transform (DWT) representation. We show that if the initial perturbations are 
Gaussian, the relation between the evolved DWT mode and the initial perturba- 
tions in the weakly non-linear regime is quasi-local. That is, the evolved density 
perturbations are mainly determined by the initial perturbations localized in the 
same spatial range. Furthermore, we show that the evolved mode is monotoni- 
cally related to the initial perturbed mode. Thus large (small) perturbed modes 
statistically correspond to the large (small) initial perturbed modes. We test 
this prediction by using QSO Lya absorption samples. The results show that the 
weakly non-linear features for both the transmitted flux and identified forest lines 
are quasi-localized. The locality and monotonic properties provide a solid basis 
for a DWT scale-by-scale Gaussianization reconstruction algorithm proposed by 
Feng & Fang (Feng & Fang, 2000) for data in the weakly non-linear regime. With 
the Zel'dovich solution, we find also that the major non-Gaussianity caused by 
the weakly non-linear evolution is local scale-scale correlations. Therefore, to 
have a precise recovery of the initial Gaussian mass field, it is essential to remove 
the scale-scale correlations. 

Subject headings: cosmology: theory - large-scale structure of the universe 



1. Introduction 

One of the basic goals of large scale structure study is to reconstruct the initial mass 
field of the universe. Assuming observed objects trace the underlying matter field in some 
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way, it should be possible to recover the initial conditions of the mass field. For instance, if 
the probability distribution function (PDF) of the initial mass fiuctuations is Gaussian, one 
may recover the initial Gaussian mass fluctuations by properly removing all non-Gaussian 
features via the Gaussianization reconstruction algorithm (Weinberg 1992.) 

The key step in the Gaussianization algorithm is a mapping from a smoothed observed 
non-Gaussian distribution of the density contrast, S{x) = {p—p)/p (p is the mass density, and 
p is its mean) into a smoothed initial Gaussian density contrast 5q{x). The basic assumptions 
of this mapping arc that the relation between 5{x) and the initial density distribution 5q{x) 
is local and monotonic, i.e., the rank order of the mass density fleld, smoothed over a given 
scale, is preserved even under nonlinear gravitation cvolution(Narayanan & Weinberg 1998). 
Thus, the high initial density pixels will evolve into high 5{x) pixels and low initial density 
pixels into low 5{x) pixels. With this assumption, the Gaussian mapping can be realized by 
a point-to-point, order-preserving transformation. The shape of the initial Gaussian density 
field is recovered with an arbitrary normalization. 

This method has been employed to recover the initial mass field and power spectrum 
from galaxy redshift surveys and from the transmitted fiux of the Lya absorption in QSO 
spectra (Croft et al. 1998.) The vahdity of the point-to-point (or pixel-to-pixel) recovery 
of the initial Gaussian mass fleld from the evolved mass fleld stems from the belief that the 
transmitted flux probably is a pixel-to-pixel tracer of the underlying dark matter distribution 
(Bi 1993; Fang et al. 1993; Bi, Ge & Fang 1995; Hernquist et al 1996; Bi & Davidsen 1997; 
Hui, Gnedin & Zhang 1997.) 

However, whether the order-preserving assumption is reasonable is far from clear. It has 
been argued that the order-preserving condition may be a poor approximation to the actual 
dynamics because of the non-locality of gravitational evolution (Monaco & Efstathiou 2000). 
Gravitational clustering is not localized. Even in the weakly non-linear evolutionary regime, 
the processes of cosmic clustering typically are those of objects free falling into potential 
wells (e.g. Xu, Fang & Wu 2000), Fourier mode-mode couphng (e. g. Suto & Sasaki 1991), 
and the merging of pre-virialized dark halos. These processes are generally non-local. 

In the Zel'dovich approximation (Zel'dovich 1970), the density fleld p(x, t) at (Eulerian) 
comoving position x and time t is determined by the initial perturbation at (Lagrangian) 
comoving position, q, plus a displacement S: 



The displacement S(q, t) represents the effect of density perturbations on the trajectories 
of gravitating particles. Therefore, cosmic self-gravitating systems do not follow a Eulerian 
point-to-point localized evolution. Even when the transmitted fiux is locally determined by 
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the evolved underlying dark matter distribution, the relation between the initial mass field 
and the transmitted flux might still be non-local. The initial power spectrum reconstructed 
from the point-to-point Gaussian mapping of Lya transmitted flux shows a systematic sup- 
pression on small scales (Croft et al. 1999.) This problem may be mitigated by smoothing 
the data. However, the smoothing scale is put in by hand and it is hard to understand how 
this scale can be determined from the dynamics of the non-local evolution. For these reasons 
the basis of the order-preserving transformation and the point-to-point Gaussian mapping 
need to be reconsidered. 

In this paper, wc will show first that the assumption of point-to-point locality is not 
necessary for a proper Gaussianization reconstruction. 

Recently, we showed that the transmitted fiux Gaussianized by the point-to-point Gaus- 
sian mapping is still largely non-Gaussian (Feng & Fang 2000.) That is, the non-Gaussianities 
of the original transmitted fiux are retained in the mass field recovered by the point-to-point, 
order-preserving transformation. Especially troubling is the fact that the scale-scale correla- 
tions of the Gaussianized field are about the same as before the mapping. The mass power 
spectrum recovered by the point-to-point Gaussian mapping is systematically lower than 
the initial mass spectrum on scales at which the scale-scale correlation of the recovered mass 
field are substantial. 

To solve this problem, a scale-by-scale algorithm of the Gaussian mapping was pro- 
posed (Feng & Fang 2000.) In this algorithm the order-preserving transformation is done on 
the coefficients of a space-scale decomposition rather than on each pixel of the transmitted 
fiux. This method can effectively remove the non-Gaussianities of the observed distribu- 
tion, especially clearing the scale-scale correlations. Consequently, small scale suppression 
is significantly reduced. Furthermore, the scale-by-scale Gaussianization procedure does not 
require a point-to-point order preservation, but only the order preservation of the density 
fiuctuations of the space-scale decomposed field. 

We will show that the order preservation assumption for the scale-by-scale Gaussian 
mapping is supported by the Zel'dovich approximation and that in quasi-linear evolution, 
the gravitational clustering is quasi-local and monotonic. Moreover, we will find that the 
non-Gaussian features during the weakly non-linear regime are dominated by the local scale- 
scale correlations. Therefore, at least for the transmitted fiux and forest fines of the QSO 
LycK absorption spectrum, the scale-by-scale Gaussianization algorithm has a solid basis in 
the weakly non-linear regime. 

The paper is organized as follows: Section 2 presents the quasi-locafity condition needed 
for the scale-by-scale Gaussianization recovery. In Section 3 we show the quasi-locality and 
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monotonic nature of the weak non-linear evolution using the Zel'dovich solution of the growth 
modes. Section 4 presents the result of measuring the quasi-locality for samples of the Lya 
absorption spectrum. The conclusions are summarized in Section 5. 

2. Scale-by-scale Gaussian mapping and the quasi-locality condition 

2.1. Problems with the point-to-point Gaussian mapping 

Let us briefly introduce the algorithm for a point-to-point Gaussian mapping which is 
designed for recovering the initial Gaussian mass density contrast 6q{'k) from an observed, 
non-Gaussian distribution 5(x). As an example, we consider an observed transmitted flux 
F or absorption optical depth t — — In F of the Lya absorption in QSO spectra. The dis- 
tribution is one-dimensional and of length L. The PDF of the observed transmitted flux F 
is generally non-Gaussian, while the PDF of the initial density contrast So is assumed to be 
Gaussian in a large variety of structure formation models. The relation between F — e"'^ 
and 5q is assumed to be order- preserved, or monotonic, i.e., high initial density 5q pixels 
evolve into high absorption optical depth r pixels, while low initial density pixels into low 
r pixels. Thus, a point-to-point (or pixel-to-pixel) Gaussian mapping can be done as fol- 
lows. First, sort in ascending order, the N pixels by the observed amount of flux. Second, 
assign to the n-th pixel a density contrast 5 by the solving the error function equation 
(27r)~^/^ j^^eiq){—x'^ /2)dx = n/N. This pixel-to-pixel Gaussian mapping produces a mass 
held with the same rank order as the original flux but with a Gaussian PDF. After determin- 
ing the overall amplitude of the Gaussian mapped fleld by a separate procedure, the initial 
density fleld 5o is recovered (Weinberg 1992.) 

However, the density fleld recovered by the pixel-to-pixel Gaussian mapping still ex- 
hibits non-Gaussian features. Especially, the scale-scale correlations of the Gaussianized 
fleld are almost as strong as the pre-Gaussianized flux. The pixel-to-pixel Gaussian mapping 
does not remove all the non-Gaussianities and the initial Gaussian field will not be recov- 
ered precisely. The power spectrum recovered by the point-to-point Gaussian mapping is 
systematically lower than the initial mass spectrum on scales at which the scale-scale cor- 
relation is substantial. Physically, the effect might just be due to the fact that 6 is simply 
too non-linear, and the relation between 6 and 6o is no longer monotonic. If this is the case, 
then a scale-adaptive smoothing is required before Gaussianization. 

However, a smoothing window function sometimes introduces spurious features. For 
instance, the aliasing of the power spectrum on scales around the Nyquist frequency cor- 
responds to the scale of the smoothing grid. However with the DWT analysis one need 



- 5 - 



not introduce window functions for smoothing because the scahng functions of the wavelet 
transform already play that role. The scaling function is orthogonal to the relevant wavelets. 
False correlations due to the smoothing window can then be perfectly eliminated (Feng & 
Fang 2000.) Therefore, a scale- adaptive smoothing and reconstruction can be realized by 
the scale-by-scale Gaussian mapping. 



2.2. Scale-by-scale Gaussian mapping 

Scale-by-scale, or scale-adaptive Gaussianization can be performed via the discrete 
wavelet transform (DWT)^. The first step in this process is to wavelet expand the flux 
F as 

oo 2J-1 

F(x) = F + ^^e,-,V',v(^) (2) 

j=0 1=0 

where ipj.i{.^)i 3 = 0, 1, I = 0...2^ — 1, is an orthogonal and complete basis. The wavelet 
basis (mode) ipj,i{x) is localized both in physical space and in Fourier (scale) space. The 
function ipj,i{x) is centered at position in physical space and at wavcnumber 27r x 2^ / L 

in Fourier space. The wavelet function coefficients (WFCs), e^j, arc labeled by the two 
subscripts j and / corresponding to the scale and position respectively. The Ij^i describe the 
fluctuation of the flux on scale L/2^ at position lL/2^ and are computed as the inner product 

hi^ I F^3,idx- (3) 

The density fields in cosmology are assumed to be ergodic, that is, the average over an 
ensemble is equal to the spatial average taken over one realization. This is the so-called fair 
sample hypothesis (Peebles 1980). A homogeneous Gaussian field with a continuous power 
spectrum is certainly ergodic(Adler 1981). In some non-Gaussian cases, such as homogeneous 
and isotropic turbulence, ergodicity also approximately holds(Vanmarke, 1983). Roughly, the 
ergodic assumption is reasonable if the spatial correlations decrease sufficiently rapidly with 
increasing separation. This property can effectively be used by the DWT because the wavelet 
ipj^iix) are orthogonal with respect to the position index /. The SFCs ij^i {j = 0...2^ — 1) 
can be considered as 2^ independent sampling, without false correlations caused by a non- 
orthogonal or redundant decomposition. Thus, for ergodic random fields the 2^ WFCs form 



^For the details of the DWT refer to the classic papers by Mallat (1989a, b); Meyer (1992); Daubechies, 
(1992) and references therein, and for physical applications, refer to Fang & Thews (1998) and references 
therein. 
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a statistical ensemble on scale j. So for quantity Xj, the ensemble average is given by 

(^^■)^^E^^v (4) 

1=0 

where Xj^i is the quantity at position I. In other words, the distribution of the 2^ WFCs is 
the one-point distribution of Xj. 

In short we have arrived at the scale-by-scale Gaussianization algorithm which consists 
of the following steps: 1.) Sort out the 2-' WFCs, Cj^;, of the flux field F in ascending order. 2.) 
Assign to the 2^ WFCs, the Gaussianized density contrast at position I by the solving the 
error function equation. In this algorithm, the order-preserving transformation is performed 
scale-by-scale. It gives a position(Z)-to-position(Z) Gaussian mapping for each scale j. After 
a proper amplitude normalization (Feng & Fang 2000), the initial Gaussian density field Sq is 
reconstructed with the Gaussianized WFCs e| ^ (we have added the subscript g to emphasize 
that we are dealing with a Gaussian field.) 

The e^ji are normally distributed with covariance given by 

= Pu^hj'kv- (5) 

For a homogeneous Gaussian field, P^y is /-independent, i.e., P^y = Pj. Thus, Pj is the power 
spectrum of the initial field. (Pando & Fang 1998b, Feng k Fang 2000, Fang k Feng, 2000.) 
Because the 2-^ WFCs e^^ form a fair ensemble, the DWT power spectrum Pj of the initial 
field can be calculated by 

1=0 

where 

In our numerical calculation, we generally use the compactly supported basis Daubechies 
4 (D4) (Daubechies 1992). The choice of particular compactly supported wavelet basis does 
not effect the statistical results (Pando k Fang 1996) and the D4 wavelet is chosen as a 
matter of convenience. Although the so called Haar wavelet appears to be simpler, it is not 
well behaved in scale space (Fang k Thews 1998.) 



- 7- 



2.3. An example 

To demonstrate the scale-by-scale Gaussian mapping, we recover the initial Gaussian 
mass field of a simulation sample of Lya forests. 

The simulation samples are produced by a semi-analytic model of the intergalactic 
medium (Bi 1993; Fang et al. 1993.) This model approximately fits most observed features 
of the Lya clouds, including the column density distribution and the number density of the 
lines, the distribution of equivalent widths and their redshift dependence, the clustering of 
the lines, and even the 3rd- and 4th-order non-Gaussian features (Bi, Ge & Fang 1995; Bi & 
Davidsen 1997, Feng & Fang, 2000.) We produce the simulation samples in redshift range 
z — 2.066 ~ 2.436 with 2^^ pixels. The corresponding simulation size in the CDM model is 
189.84 h~^Mpc in comoving space. These scales weakly depend on Q. 

Figure 1 shows a typical realization of the simulated Lya, including the flux F, density 
contrast of IGM, biau-, pecuhar velocity of the IGM, l^gc and the column density of neutral 
hydrogen Nhi- The recovered IGM density distributions on scales 10/2^ h~^ Mpc and 
2 — 6, 9, 12 are plotted in Fig. 2. These results show that the density distributions can be 
well reconstructed by the DWT algorithm. The small deviation of the recovered distributions 
from the original one is due mainly to the effect of the peculiar velocity. Figure 3 plots the 
DWT power spectra of the initial and recovered fields. It shows that the recovered power 
spectrum is in excellent agreement with the initial one until scale j = 9 or /c = 10 h Mpc~^. 
This means that the effect of the peculiar velocity is significant only on small scales j > 9. 

However, our goal is to put the scale-of-scale algorithm on a firm foundation and so 
recovering the power spectrum from simulation samples is insufficient for our purposes. 
We must investigate the conditions under which the basic assumption of the scale-of-scale 
algorithm - the order-preserving transformation - is reasonable. 



2.4. Conditions for quasi-locality 

The scale-by- scale Gaussian mapping assumes the order preservation between the WFCs 
Ij^i of the evolved field and the initial WFCs e^^ at each scale j. For scale j, index / cor- 
responds to a spatial range L/2-^. The scale- by-scale Gaussianization algorithm does not 
require a point-point (or pixel-pixel) locality, but only a locality in the range of Ax = LjlK 

For scale j, the corresponding wavenumber is /c = 2t:2^ / L and we have 

Ax oc 2TT/k. (8) 
That is, the size of the locality Ax in the scale-by-scale Gaussian mapping is scale-adaptive. 
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Only for wavelengths corresponding to one pixel is pixel-pixel locality required. 

Obviously, if the locality condition is valid, the mode-mode correlation will be dominated 
by the local term. Thus, one can set a condition for the locahty as 

%?f^«l, ly^l'. (9) 

For cosmological problems, the second order correlations only depend on the differ- 

ence Al = I' — I, and eq.(9) can be calculated as 

(10) 

where Kj-j^Al) is defined by 

1 



1=0 



The ratios (10) or (11) can be used to estimate the error caused by the locality assumption. 



3. The evolution of the DWT modes in the Zel'dovich approximation 

In this section, wc will show that conditions eqs.(lO) or (11) arc reasonable in the weakly 
non-linear regime. Wc use the Zel'dovich solution, which describes the growing mode of the 
gravitational clustering in the quasi-linear and even non-linear regime until the variance of 
the density field is of order unity. 



3.1. The Zel'dovich solution in the DWT representation 

In the Zel'dovich approximation [eq.(l)], the displacement vector field S(q, t) is given 

by 

S(q,t) = -6(t)V0|q, (12) 

where h{t) is the linear growth factor and 0(q) is the initial irrotational peculiar velocity 
potential (Catelan 1995.) The Eulerian density field can be described as 

p(x, t) = Pit) J d'qS^'i^ - q - S(q, t)], (13) 

where 5^(x) denotes the 3-dimensional Dirac delta function. 
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In the DWT representation, a density field p(x), x = {xi,X2,X3), in range < Xi < 
Li, {i = 1, 2, 3} is described by the WFCs 

ej.iW = J p(x,t)V^j,i(x)c?x, (14) 

where the '?/'j,i(x) are again an orthogonal and complete basis in 3- dimensions, constructed 
by the direct product of the 1-dimension wavelets (Daubechies 1992; Fang & Thews 1999) 

V'j,i(x) ^ipn,hixi)'4^j2,hi^2)ipjs,isi^3)- (15) 
Substituting eqs. (13) and (15) into eq.(14), we have 

ej,i(i)- J d3#j,i[q-6(i)V0]. (16) 
This is the solution of the DWT mode (j, 1) evolution in the Zel'dovich approximation. 

3.2. Weakly non-linear evolution of the DWT modes 

In the weakly non- linear regime, one can expand eq.(16) in a power series with respect 
to b{t) 

^J.iW = E ^!r^ / ^'^(V*^ • V)>j,i(q). (17) 

n=0 



The evolution of the DWT mode (j, 1) can be calculated from the first few terms of eq.(17). 
Specifically, the first 4 terms give 

1. The n — term is 

fiH^) = / ^j,i(q)^i'g = 0, (18) 
where we used the admissibility property of wavelets. This is the uniform background. 

2. The n = 1 term is 

= -b{t) J £qiV<P ■ V)^j,i(q) = b{t) J d'q\/'<P'4'M (19) 
Because is equal to the initial density contrast 5o(x), we have 



]> (() = bit) j d'qMl)Ml) = m^it (20) 



-10- 



where ^ j is given by 



= / d^x5o(x)^j,i(x). (21) 



Therefore, e? j is the initial density perturbation of mode (j, 1). Equation (20) gives the hnear 
growth of mode (j, 1) as b{t) is taken to be the growth solution of the density perturbations. 



3. The n — 2 term is 

k2/ 



= ^ / d'q{V<P ■ V)^^j,.(q) = -b^t) J VVV0 ■ V^j,.(q). (22) 
One can also decompose the initial velocity field V0 into the DWT modes (j, 1) as 

^II = / d'x^'PjA^). (23) 

Thus, eq.(25) can be rewritten as 

4J(t) = -6^(^)6-?, ,,6];5 J ,(q)^j.,i.(q)^^j,i(q) (24) 

where the summation conventions for j', 1', j", 1" and i are employed. Since the 3-dimensional 

wavelets are given by the direct product of 1-dimensional wavelets [eq.(15)], all terms in the 
r.h.s. of eq.(27) contain the three- wavelet integrals J dx-'tjjji^ii(x)-'tpji/^iii(x)-^ipj^i(x). These 
integrals are non-zero only if the spatial ranges of the modes j', I' and j" , I" overlap j, I. For 
the Daubechies 4 wavelet, /' and /" should not differ from / by more than 2. Thus, this 
integral requires that (/, /', /") be localized. 



4. The n — 3 term is 

^iW = / c^MV0 • V) Vj,i(q) (25) 



b^(t) f d d 

= ^4,i'^y',i"$M"' J o^'#jM'(q)V'j",i"(q)V'j"M"'(q)^^V'j,i(q) 

Similar to the 3-wavelet integral, the 4- wavelet integrals in eq.(25) require that the modes 
j', 1', j", 1", j'", V" and j, I be localized. 

Thus, the behavior of mode (j, 1) in the weak non-linear regime can be estimated as 

eUt)-~^ht) + eS{t) + eS{t). (26) 
Considering the locality of the wavelet integrals in eqs(24) and (25), eq.(26) yields 

d 



c?'#j,i(q)V'j",i"(q)^V'j,i(q) (27) 

/d d 
c?^#j,i(q) V'j'M" (q) V^j'M'" (q) ^ 9^^J.i(^) ] 
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If the initial density and velocity perturbations are Gaussian, we have 

= (fl) = (28) 

and 

= = 0' (29) 

if j' ^ j" or 1' ^ 1". 

All high order cumulants of ej,, p. and eyy are also zero. 

Thus, in the bracketed expression of the r.h.s. of eq.(27), the second term is zero on 
average because the initial Gaussian velocity perturbations {^yi y,) = 0. The third term in 
the bracket is proportional to the variance of ejj, and then to the variance of e? j. Therefore, 
large initial WFGs statistically evolve into large ey\{t), and low initial WFGs into low 
WFGs. The evolution is monotonic. 



3.3. Quasi-locality of second order correlations in phase space 

The locality can be expressed by the mode-mode correlations. For the second order 
correlation between modes (j"*^, l"*^) and (j^, 1^), the non-zero terms up to order b'^ are 

(eji,,i(t)ej2,,2(t)) = (30) 
= {e^,At)e'p',At)) + {^]Am'p>,At)) + (?S.i(^)?gi.(^)) + {~e'^,Am'p',At)) 

/Of d 
c?^#j,i(q)^j",i" (q) ^^jM^ (q) / c?^#jM' (q)^j"M"' (q) ^^jm^ (q) 



/d d 
c^'^#jM' (q)^j",i" (q)^j"M"' (q) ^ ^^j^p (q) 



2 

+[(jM')^(jM')]}. 

The term b'^{t) is the linear term and the 6^(i) term gives the first non-linear correction. 
Due to the locality of the 3- and 4- wavelet integrals, eq.(30) yields 

(eji,,i(i)ej.,p(t))~0. (31) 

The second order correlations between the DWT modes (j, 1) are localized [eqs.(9) and (10)]. 
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Since the 3- and 4-wavelet integrals do not require that scales j be the same as j', j", 
and j'", there exist correlations between modes on different scales but localized in the same 
spatial range. That is 

(eji,,iej.,,i) ^ 0. (32) 

This second order correlation actually is a non-Gaussian correlation which corresponds to 
the non-random phase of the Fourier modes, or the phase-phase correlations between the 
Fourier modes (Feng & Fang 2000) 

Moreover, the spatial size of a mode on scale j is Thus, the scale-scale correlations 

between j' and j lead to a dependence of perturbations at on the perturbations at 

. The two perturbations are not perfectly localized in the same area, and therefore, the 
coupling between the DWT modes is quasi-local. In the fully developed non-linear regime, 
the strong scale-scale correlations will finally lead to non-local correlations of the DWT 
modes. 



3.4. Non-Gaussianity of third-order correlations 

The non-Gaussianities on higher-order correlations caused by the weakly non-linear 
evolution can be calculated by the cumulants of ej,i. Using eq.(29), the third order correlation 
up to order b"^ is 

(^JM^^jM^^jM-) = {e^,At)^'^At)^%^it)) (33) 
+ [{^yL\i{t)ep\2{t)eP^3{t)) + 2 terms with eye. permutations]. 

For Gaussian initial perturbation eq.(28), we have 

(e^i;i.(t)egj.(t)e«3(t)) oc (e9.,.(t)ej.,.(t)4,3(t)) = 0. (34) 
Using eq.(24) we have 

(e^i;,.(t)6^^;p(t)6gi3(t)) oc (35) 

c?'#jM'(q)V'j'M"(q)^V'j3,i3(q) = 0. 

Therefore, the largest two terms of eq.(33) are zero and we have 

(eji iiej2 peja 13) ~ 0. (36) 
That is, the one-point distribution of the DWT modes (j, 1) contains at most a small skewness. 
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This result can also be obtained from the hierarchical clustering or linked-pair approx- 
imation as 

(<5(xi)5(x^)5(x^)) ^ g3[(5(xi)5(x^))(5(x^)5(x=^)) (37) 
+ 2 terms with eye. permutations], 

where (5(x) = [p(x) —p\/p. Since wavelet is admissible, i.e., J '^j^i{x)dx — 0, eq.(17) gives 

hiit) = --j 5(x)Vj,i(x)dx. (38) 

We can take p = 1, i.e., the mean density is normalized. 

Expressing eq.(38) in the wavelet basis ■0j,ii(x"'^)'0j2,i2(x^)'0j3 i3(x^), we have 

i',i" 

+ 3 terms with eye. permutations] . 
where a^^ is given by the 3-wavelet integral, 

= j V'j,ii(x)V'j,i'i(x)V'j,i"i(x)cix. (40) 

a^^ I'l i"! generally is small and exactly equal to zero for the Haar wavelets (Mallat 1989a; 
Meyer 1992). Therefore, a hierarchical clustered field is only weakly skewed. Note however, 
that this does not imply that the skewness in the DWT representation is small in general. 

3.5. Non-Gaussianity of fourth-order correlations 

In contrast, the 4th-order correlations of the WFCs given by eq.(26) are generally non- 
zero at order h'^. The first significant non-zero higher order term not caused by p > is the 
kurtosis defined by 

^3 = 3. (41) 

The kurtosis can be considered as a special case (j = / and AZ = 0) of the fourth order 
scale-scale correlations defined by 



C-(A/) = (42) 
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C^.'v(AZ) measures the correlations between the perturbations of modes on scales j and /, 

' 2 2 

at two positions separated a distance AIL/2K In the case of Ai = 0, c;:J,{o) measures the 
correlation between fluctuations on scale j and j' at the same physical point. 

Using the locality of the 4- wavelet integral in eq.(25), one can show that the scale-scale 
correlations in the weakly non- linear regime are also localized, i.e., 

eg (A/ 7^ 0)< Cg(0). (43) 

In the weakly non-linear regime, the phase space behavior of gravitational clustering for 
an initially Gaussian mass field can be summarized as 

1. The evolution is spatially quasi- localized, i.e., the correlations between WFCs at dif- 
ferent positions are always less than that at the same position. 

2. The density field contains a very small skewness. 

3. The major non-Gaussianities are local scale-scale correlations. 

It should be pointed out that if a distribution possesses the above-listed features, it does 
not mean that the initial mass field is Gaussian. Actually, for a distribution to exhibit these 
features, we only need the following conditions on the initial density perturbation: 

= 0' (44) 
when the spatial range of I doesn't overlap I', and 

^ 0, (45) 

when the spatial range of 1 does overlap I'. The conditions given by eqs.(44) and (45) are 
weaker than eqs.(28) and (29). The initial field given by eqs.(44) and (45) can be non- 
Gaussian. 

Equations (44) and (45) show once again that the statistical properties of a random 
field should be described by two types of correlations: one with respect to scale j and the 
other with respect to position I. These two types of correlations correspond to correlations 
with respect to the phase and amplitude in the Fourier representation. It is, in fact, possible 
to construct clustering models which lead to a density field with a Poisson or Gaussian PDF 
with respect to I, but that are highly scale-scale correlated (Greiner, Lipa & Carruthers, 
1995.) 
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3.6. Redshift space 

If the density field is viewed in redshift space, the observed radial position is given by 
the radial velocity consisting of the uniform Hubble flow and the peculiar motion v(x). Thus 
the position x in eq.(l) should be replaced by 

s = x+ [z • v(x)]z, (46) 

where z is taken to be in the direction of the line of sight. The second term in eq.(46) is the 
correction due to the radial peculiar velocity. 

In redshift space the Zel'dovich solution eq.(16) is replaced by (e.g. Taylor & Hamilton 
1996) 

p(s, t) = -pit) j d'q6^[s - q - S^(q, t)]. (47) 
In the linear approximation of the velocity field, the displacement vector field is 

S'(q) = S(q) +l]°-6[z-S(x)]z, (48) 
where Q is the cosmological density parameter. Thus, the solution [eq.(19)] becomes 

6j,,(t) = J d'q^^;i,,[q + S%q,t)]. (49) 

This is the Zel'dovich solution for the DWT modes in redshift space. Mathematically, eq.(48) 
is equivalent to eq.(16), with S^(q, i) replacing S(q, i). Therefore, all results drawn from 
eq.(16) still hold for eq.(49) if the intersection of the particle trajectories has not yet happen. 

It is known that the non-linear effect of the redshift distortion, like the Fingers-of-God 
in the galaxy distribution in redshift space, cannot be modeled by the Zel'dovich solution. In 
other words, the non-finear effects of the redshift distortion are not negligible if the Zel'dovich 
solution fails. Thus, the scale on which the quasi-locality assumption is no longer correct is 
also the scale where the non-finear redshift distortion effects emerge. This result is consistent 
with the numerical example shown in §2.3, which shows that on scales as small as j — 9, the 
scale-by-scale reconstruction is still very good, and the redshift distortion effects are not yet 
significant. 

4. Quasi-Iocality in the Lya forests 

To test the predictions on the quasi-locality of the DWT mode-mode correlations (§3.5), 
we analyze samples of the QSO Lya absorption spectrum which are believed to trace the 
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underlying mass field in the weakly non-linear regime. The second point listed in §3.5 has 
been tested by our previous studies, which showed no skewness above a positive definite 
random distribution for the distributions of Lya forests lines (Pando & Fang 1998a) or the 
transmitted flux (Feng & Fang 2000.) In this section, we will focus on the points 1 and 3. 

The QSO Lya absorption spectrum samples used in our analysis are: 1.) the transmit- 
ted flux of HS1700-I-64 Lyo; absorptions; 2.) the Lya forest lines from moderate resolution 
spectrum, including a sample compiled by Lu, Wolfe, & Turnshek (LWT) (1991) that con- 
tains ~ 950 lines from the spectra of 38 QSOs, and a sample compiled by Bechtold (JB) 
(1994) which contains a total ~ 2800 lines from 78 QSO's spectra in which 34 high-redshift 
QSO's were observed, and 3.) the Lja forest lines from high resolution spectrum, including 
the data of Hu et al. (HU) (1995) which has ~ 1056 fines from four quasars in wavelength 
range 4300-5100 A measured with the HIRES spectrograph on the Keck, and the HIRES 
data of QSO HS1946+7658 Of Kirkman & Tytler (HT) (1997). 

The simulation samples used in this section are the same ones used in §2.3. 

4.1. Quasi- locality of the second order DWT mode- mode correlation 

Figure 4 shows Kjj{Al) for 100 realizations of the simulated Lya transmitted fiux and 
QSO HS 1700+64. Here we use the Daubechies 4 wavelets in our calculations as these wavelets 
arc better behaved in scale space. Figure 4 shows clearly that the condition for locality 
eq.(13) holds. The correlations Kjj{Al) for all AZ 7^ are sharply lower than Kjj{0). For 
small scales j > 5, at the 95% confidence level, the non-local correlations are no more than 
20%. 

Figure 5 shows the correlation K,jj+i{Al), the second order correlations between scales 
j and j + 1. Comparing with Kjj{Q), the second order non-local scale-scale correlations 
{Al ^ 0) show no power. For the local case, ^^^+1(0) generally also has no power. But 
it is interesting to note that «;jj+i(0) becomes significant on very small scales j — 9, 10. 
This is a second order non-Gaussian detection. One can similarly detect other second order 
scale-scale correlations, such (j, j -|- 2), (j, j -|- 3) etc. with similar results. 

Figures 6 and 7 are similar to Fig. 4 but for the LWT/JB and HU/KT Lya forest fine 
samples. Although the identification of absorption lines from transmitted fiux may introduce 
an arbitrary bias, the LWT/JB and HU/KT data show the same behavior as «;jj(A/) in Fig. 
4. For scales j > 5, the locality eq. (10) is a very good approximation, even considering the 
large error in the data. For each QSO, we calculate the statistical quantity Kjj+i(A/), and 
the mean and l-cr error bars are found from the ensemble of these QSOs. Since there are 
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more QSO's in the LWT/JB data, the error bars for the LWT/JB data are smaller than the 
high resolution data HU/KT. 

Figures 8 and 9 are similar to Fig. 5 but again for the Lya forest hues. The results 
once again show that the behavior of the second order scale-scale correlations for the forest 
line samples is the same as the transmitted flux samples. There is no power in these second 
order scale-scale correlations. 



4.2. Quasi-locality of 4th order non-Gaussianity 

In the previous section, we detected the local scale-scale correlation from the second 
order statistics. The local correlation of the 4th-order statistics is more prominent. In our 
previous studies, we have shown that the Lya forests are significantly non-Gaussian at 4th- 
order, including the kurtosis (Pando & Fang 1998a) and the scale-scale correlations (Pando 
et al. 1998, Feng & Fang 2000). In this section, we will show that although the 4th-order 
local correlations are strong, the non-local correlations are weak. 

The 4th-order statistic in Figure 10 is defined by 

2^ ^3,1 1^ S;'+Ai 

For the local case, i.e., AZ = 0, (5^'^(0) is the compact form of the kurtosis 

K, = (51) 

J r^2J-l -2 12 



1=0 S',, 



Figure 10 shows that the non-local correlation Q^'^(A/) with A/ 7^ of the transmitted 
flux generally is aroimd 1, i.e., no correlation. On small scales j > 5 the local correlation 
Qj'^(O) gradually becomes significant, while the non-local correlation Q^''^(Al) with A/ 7^ 
still remains at 1. This result supports the quasi-locality condition. 

Figure 11 displays C|'^(A/), which is the scale-scale correlation between scales j and 
j + 1, and shows that the non-local scale-scale correlation C^' (A/) with A/ 7^ generally is 
around 1, i.e., no correlation. When the local scale-scale correlation becomes significant on 
scales j > 7, the non-local correlation C|'^(A/) with A/ 7^ still remains at 1. 

Unlike the second order scale-scale correlations, the 4th-order correlations (5^'^(0) and 
Cj' (0) are significant. There is substantial power on all scales j > 5. 
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Figures 12 and 13 show the scale-scale correlations for the LWT/JB and HU/KT Lya 
forest lines, respectively. Figure 9 clearly shows a steady decrease in the scale-scale ampli- 
tudes with A/. This again demonstrates the quasi-locality. Fig. 10 also shows a decrease in 
the scale-scale amplitudes with Al, but it is rather slow. Even when Al = 3, the correlation 
remains. These non-local correlations may be caused by problems in identifying the lines, 
and if so, indicates that the transmitted flux is better suited than the Lya forest lines for a 
Gaussianization reconstruction. 



5. Conclusions and discussions 

In this work we have demonstrated that the assumptions necessary for reconstructing 
an initial density field by a scale- dependent Gaussianization procedure arc justified. The 
assumptions, namely the locality and monotonic nature of the gravitational clustering, are 
borne out by both the Zel'dovich approximation and correlation analysis of QSO Lya sam- 
ples. That is, the DWT mode-mode couphng caused by the weakly non-linear evolution 
of the gravitational clustering is quasi-local and that the large initial WFGs evolve into 
large WFGs of the observed density field. This result provides a solid basis for the DWT 
scale-by-scale Gaussianization reconstruction of Lya forests. 

The non-Gaussianities produced by the mode-mode coupling during the weakly non- 
linear regime are mainly local scale-scale correlations. This result explains why the Gaus- 
sianization reconstruction is substantially improved by removing the local scale-scale corre- 
lations (Feng & Fang 2000.) Glearing the local scale-scale correlation is a key for an effective 
Gaussianization reconstruction. All these features provide a solid basis for the DWT scale- 
by-scale (or scale-adaptive) Gaussianization reconstruction algorithm. 

Since galaxies undergo a highly non-linear evolution, and their bias might be non- 
monotonic, the validity of the order preserving assumption for the galaxy field needs further 
study . Whether a sample is suitable for the DWT scale-by-scale reconstruction can be 
judged by detecting the non-local mode-mode correlations. If the scale-scale correlations 
are significantly non-local, the sample will not be a good candidate for a DWT scale-by- 
scale Gaussianization. The local scale-scale correlation have been detected in recent years 
for galaxy samples, such as the APM bright galaxy catalog (Feng, Deng & Fang 2000.) 
However, it remains to be determined whether these couplings are non-localized. 

In essence, the DWT analysis looks at the phase space behavior of the gravitational 
clustering. While there are many perturbation calculations of the clustering dynamics in the 
Fourier and coordinate representations (Buchert 1993; Bouchet et al. 1995; Gatelan 1995), 
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the locality cannot be revealed with these calculations, because in the Fourier representation, 
the phases hold the position information (locality) of the relevant Fourier modes. This makes 
it practically impossible to determine the locality of clustering in the Fourier representation. 
The DWT analysis may also be useful in studying other clustering features in the Zcl'dovich 
approximation for which a phase space description is essential, such as the breakdown of 
the Galilean invariance or the lack of momentum conservation (Polyakov 1995; Scoccimarro 
1998.) 

This work was supported in part by the LWL foundation. We thank Dr. D. Tytlcr for 
kindly providing the data of the Keck spectrum HS1700+64. LLF acknowledges support 
from the National Science Foundation of China (NSFC) and World Laboratory Scholarship. 
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Fig. 1. — A simulated sample of the Lyct forest, including the flux F, the IGM density 
contrast Sjgm, peculiar velocity V^ec and column density of neutral hydrogen Nhj. 

Fig. 2. — The IGM density field (left panel) and flux (right panel). The sohd lines are for 
the reconstructed density field and flux by the scale-by-scale Gaussianization. The dotted 
lines are the original fields. 

Fig. 3. — The DWT power spectra of the mass fields recovered by the scale-by-scale algo- 
rithm. The error bars are the la deviation calculated for the 100 realizations. 

Fig. 4. — The dependence of K,jj{Al) on Al. K,jj{Al) is normalized to Hj,j{0) = 1, and the 
spatial distance between the two modes (j, /) and (j, /') is D — LAI, where L is the length 
of the sample. The results are obtained from 100 realizations of simulated Lya transmitted 
flux. The central line is the mean and the upper and lower lines show the 95% confidence 
level. The result for QSO HS 1700-1-64 is displayed as the solid squares. 

Fig. 5. — The A/-dependence of correlation Kjj+i(A/). where Kjj+i{Al) is normalized to 
,,(0) = 1, and the spatial distance between the two modes (j, /) and (j + 1, /') is D = LAI, 
where L is the length of the sample. The results are obtained from 100 realizations of 
simulated Lya transmitted flux. The central line is the mean and the upper and lower lines 
show the 95% confidence level. The result for QSO HS1700-I-64 is displayed as the solid 
squares. 

Fig. 6. — K,j.j{Al) for the moderate resolution data of the Lya forest hues LWT/JB. The 
case of I — I', or Al — 0, is the locahzed case, and I' + 1 + 1 and I' — 1 + 2 show the non-local 
correlations. Kj-j{Al) is calculated for each QSO absorption line system and the 1-a bars are 
determined from the variance of the data consisting of all the systems. 

Fig. 7. — Same as figure 4, but for the HU/KT high resolution absorption samples of Lya 
forest lines. 

Fig. 8. — The correlation Kjj+i{Al) vs. Al — I' — I for the moderate resolution data of 
Lya forest hues LWT/JB. As expected, there is no power in this second order scale-scale 
correlation at any scale. K,j-j+i{Al) is calculated for each QSO absorption hne system and 
the 1-a bars determined from the variance of the data consisting of all the systems. 

Fig. 9. — Same as figure 5 but now using the high resolution data HU/KT. 
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Fig. 10. — The 4th-order correlation (5^'|(A/) from 100 realizations of simulated Lya trans- 
mitted flux and QSO HS1700-I-64. The spatial distance between the two modes (j, /) and 
(j, /') is D — LAI, where L is the length of the sample. The central line is the mean and the 
upper and lower lines show 95% confidence level of the simulation samples. The result for 
the QSO HS1700-I-64 is displayed as the sohd squares. 



Fig. 11. — The 4th-order scale-scale correlation C|'^(AZ) from 100 reahzations of simulated 

Lya transmitted flux and QSO HS1700+64. The spatial distance between the two modes 
(j, /) and (j + 1, /') is D = LAI, where L is the length of the sample. The central line is the 
mean and the upper and lower lines show the 95% confidence level of the simulation samples. 
The result for the QSO HS1700-I-64 is displayed as the solid squares. 



Fig. 12. — The 4th-order scale-scale correlations, C|'^(A/), for the LWT/JB data. C|'^(A/) 
is calculated for each QSO absorption line system and the 2-a bars determined from the 
variance of the data consisting of all the systems. 



Fig. 13.— The 4th-order scale-scale correlations, Cf^{Al) for the HU/KT data. C|'^(AZ) 
is calculated for each QSO absorption line system and the 2-a bars determined from the 
variance of the data consisting of all the systems. 
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